Prediction trees with soft nodes for binary outcomes.

نویسندگان

  • Antonio Ciampi
  • André Couturier
  • Shaolin Li
چکیده

Consider the problem of predicting the occurrence of an event, the onset of diabetes mellitus, say, from a vector of continuous and discrete predictors. We propose a new algorithm for the construction of a tree-structured predictor for the event of interest, which uses a new approach for dealing with continuous predictors. The novelty is that the tree uses splits for continuous variables. This means that at each node an individual goes to the right branch with a certain probability, function of a predictor. The predictor as well as the particular shape of the function is chosen from the data by the proposed algorithm. We evaluate its performance on several real data sets, in particular comparing it with a standard tree-growing algorithm. We also present an analysis of a well-known data set, the Pima Indian diabetes data set, to illustrate the application of the method in biostatistics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

پیش‌بینی پارامترهای امواج ناشی از باد در دریای خزر با استفاده از روش درختان تصمیم رگرسیونی و شبکه های عصبی مصنوعی

Prediction of wave parameters is necessary for many applications in coastal and offshore engineering. In the literature, several approaches have been proposed to wave predictions classified as empirical based, soft-computing based and numerical based approaches. Recently, soft computing techniques such as Artificial Neural Networks (ANNs) have been used to develop wave prediction models. In thi...

متن کامل

Bagging Soft Decision Trees

The decision tree is one of the earliest predictive models in machine learning. In the soft decision tree, based on the hierarchical mixture of experts model, internal binary nodes take soft decisions and choose both children with probabilities given by a sigmoid gating function. Hence for an input, all the paths to all the leaves are traversed and all those leaves contribute to the final decis...

متن کامل

Bayesian analysis of binary prediction tree models for retrospectively sampled outcomes.

Classification tree models are flexible analysis tools which have the ability to evaluate interactions among predictors as well as generate predictions for responses of interest. We describe Bayesian analysis of a specific class of tree models in which binary response data arise from a retrospective case-control design. We are also particularly interested in problems with potentially very many ...

متن کامل

A New Heuristic Algorithm for Drawing Binary Trees within Arbitrary Polygons Based on Center of Gravity

Graphs have enormous usage in software engineering, network and electrical engineering. In fact graphs drawing is a geometrically representation of information. Among graphs, trees are concentrated because of their ability in hierarchical extension as well as processing VLSI circuit. Many algorithms have been proposed for drawing binary trees within polygons. However these algorithms generate b...

متن کامل

Lower Bounds on Quantum Query Complexity for Read-Once Decision Trees with Parity Nodes

We introduce a complexity measure for decision trees called the soft rank, which measures how wellbalanced a given tree is. The soft rank is a somehow relaxed variant of the rank. Among all decision trees of depth d, the complete binary decision tree (the most balanced tree) has maximum soft rank d, the decision list (the most unbalanced tree) has minimum soft rank √ d, and any other trees have...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistics in medicine

دوره 21 8  شماره 

صفحات  -

تاریخ انتشار 2002